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FOREWORD 


This  is  the  first  in  a  series  of  papers  dealing 
with  the  problem  of  finding  the  optimal  decision  rules 
for  n -period  chance-constrained  programming  mod¬ 
els.  The  significant  feature  of  this  paper  is  that  the 
admissible  class  of  decision  rules  is  not  required  to 
possess  any  specific  analytic  property,  as  it  is  in  all 
previous  work  in  this  field.  Instead  the  admissible 
class  is  the  largest  possible  class  of  decision  rules 
consistent  with  the  interpretation  given  to  n -period 
problems.  It  is  shown  that  in  this  case  the  optimal  de¬ 
cision  rules  are  piecewise  linear  functions  of  certain 
conditional  fractile  points  and  the  decision  rules  of  all 
preceding  periods. 
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Optimal  Decision  Rules 
for  the  E  Model 

of  Chance-Constrained  Programming 


ABSTRACT 


The  first  five  sections  of  this  paper  contain  an  intro¬ 
duction  to  the  topic  of  chance -constrained  programming. 
Then  the  general  n -period  expectation -objective  model  of 
chance -constrained  programming  is  presented  and  certain 
necessary  conditions  are  established  for  decision  rules  to 
be  optimal  for  such  a  model.  The  question  of  the  consisten¬ 
cy  of  the  constraints  and  the  finiteness  of  the  optimal  value 
of  the  objective  function  for  such  problems  is  discussed  and 
several  methods  of  resolving  these  questions  are  presented. 
The  simplification  that  results  when  the  chance -constrained 
problem  is  treated  as  a  problem  of  linear  programming  un¬ 
der  uncertainty  is  also  discussed.  The  paper  is  concluded 
by  solving  two  two-stage  problems. 
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1.  INTRODUCTION 


As  an  introduction  to  the  topic  of  chance-constrained  programming,  the 
various  kinds  of  objective  functions  and  admissible  classes  of  decision  rules 
for  chance-constrained  problems  that  have  appeared  in  the  literature  are  dis¬ 
cussed  in  this  paper,  and  remarks  are  illustrated  by  using  the  stochastic  heat¬ 
ing-oil  problem.  The  general  n -period  expectation-objective  model  is  then 
presented  and  optimal  decision  rules  for  such  a  model  are  shown  to  be  piece- 
wise  linear  functions  of  certain  conditional  fractile  points.  Using  these  results, 
the  simplification  obtained  by  treating  the  problem  as  one  of  linear  programming 
under  uncertainty  is  discussed.  The  paper  is  concluded  by  solving  two  rathei 
simple  two-stage  problems. 

The  topic  of  chance-constrained  programming  is,  perhaps,  best  introduced 
by  first  examining  an  ordinary  linear-programming  problem  in  its  most  gen¬ 
eral  form,  i.e., 


maximize 


crV 


subject  to 


AV  £  h 


(1) 


where  A  is  an  m  >  n  matrix  of  constants,  and  b,c  are,  respectively,  m  *  1  and 
n  x  l  constant  vectors.  The  object  of  problem  1  is  to  find  the  n  *  1  vector  X 
from  the  set  of  all  X  satisfying  AX  <  b,  which  maximizes  cTX. 

If  a  chance-constrained  formulation  were  to  replace  problem  1  a  problem 
of  the  following  type  would  be  obtained: 


maximize 
subject  to 


ft  c .  \) 

r< A\  <  b>  2  a, 


(2) 


where  P  means  probability.  Here  A,  b,c  are  no  longer  constant  matrices  as 
they  were  in  problem  1;  rather,  they  can  have  some  or  all  of  their  elements  as 
random  variables.  The  m  >  1  vector  a  contains  a  prescribed  set  of  constants 
that  are  probability  measures  of  the  extent  to  which  constraint  violations  are 
permitted.  In  other  words,  an  element  a,  t  a  and  satisfying  0  s  a,  ^  1  is  asso¬ 
ciated  with  the  constraint 


to  give 


.?.*•**• 1  *■ 


Pl  5.  a  i.  <  t>. )  2 


i*  i 


ii  t  '  ' 


(3) 
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This  means  that  the  i  th  constraint  may  on  occasion  be  violated,  but  it  can  be 
violated,  at  most,  100(1— ot,)  percent  of  the  time. 

As  yet  nothing  has  been  said  about  the  nature  of  the  functional  of  problem 
2.  As  might  be  expected,  a  fairly  wide  range  of  reasonable  choices  can  be  con¬ 
sidered  for  the  function  f  ( c ,  X ).  In  Section  3  some  of  these  choices  will  be  dis¬ 
cussed  and  the  work  that  has  been  done  on  them  will  be  summarized. 

An  important  question  in  chance-constrained  programming  concerns  the 
classes  of  decision  rules  that  should  be  admissible  for  the  problem.  Because 
the  object  in  problem  2  is  to  find  an  optimal  vector  of  stochastic  decision  rules 
\  =  <0  (A,  b,  c)  with  the  function  <p  chosen  from  some  prescribed  class  of  functions, 
a  fundamental  question  is  how  to  choose  the  admissible  class  of  functions  for 
( p .  In  all  the  literature  published  thus  far,  it  is  assumed  that  <p  is  a  linear  func¬ 
tion  of  the  elements  of  b,  i.e.,  X  is  restricted  to  a  linear  decision  rule.  In  the 
results  of  this  paper  the  admissible  class  of  decision  rules  is  greatly  enlarged 
and  then  the  optimal  decision  rules  are  shown  to  be  piecewise  linear.  This 
result  is  of  particular  importance  because  it  shows  that  there  exists  good 
reason,  other  than  the  fact  that  it  is  mathematically  more  manageable,  to  limit 
oneself  to  finding  the  optimal  linear  rule. 

In  some  cases  the  class  from  which  <p  is  to  be  chosen  may  be  restricted 
deliberately,  so  that  at  the  time  of  application  of  the  decision  rules  X  is  not  a 
function  of  some  random  variables;  rather,  the  resulting  X  values  are  assigned 
numerical,  i.e.,  deterministic,  values.  When  this  can  be  done  it  is  said  that 
the  resulting  function  provides  a  set  of  certainty-equivalent  relations  because 
it  specifies  the  decision  exactly,  i.e.,  with  certainty. 

In  general,  however,  we  permit  a  choice  of  ofor  which  the  resulting  X 
vectors  are  also  random.  Thus  problem  2  is  solved  for  a  set  of  stochastic 
decision  rules  imputing  action  for  each  manifestation  of  the  random  variables 
involved  in  the  p~oblem.  This  is  in  contrast  to  other  possible  solution  notions 
such  as  mixed  strategies  in  game  theory  or  Manne's  probleml-which,  it  might 
be  noted,  corresponds  to  the  solution  of  only  the  first  problem  of  the  two  into 
which  the  chance-constrained  problem  was  factored  in  Charnes  and  Cooper*— 
or  Tintner’s  stochastic  programming.* 

In  either  the  stochastic  or  certainty  equivalent  case  it  may  also  be  pos¬ 
sible  to  develop  a  deterministic  equivalent  for  problem  2.  That  is,  it  may  be 
possible  to  find  a  problem  that  is  equivalent  and  yet  does  not  contain  any  of  the 
random  variables  explicitly.  Hence  it  is  a  deterministic  (often  convex)  pro¬ 
gramming  problem. 

These  deterministic  equivalents,  when  attainable,  often  provide  a  means 
of  convenient  analysis  and  characterization  of  the  solutions  of  the  given  chance- 
constrained  problem.  In  addition  they  may  also  be  valuable  in  their  own  right 
for  purposes  of  theoretical  analysis  and  policy  review.  How  to  obtain  deter¬ 
ministic  equivalents  for  certain  choices  of  f  (c,  X  )  when  the  admissible  class 
of  functions  for  <p  is  the  class  of  linear  functions  of  b  is  illustrated  in  Section  3. 

2.  A  CHANCE-CONSTRAINED  EXAMPLE:  THE  STOCHASTIC 

HEATING-OIL  PROBLEM 

In  order  to  elaborate  further  on  the  ideas  developed  above,  an  example  of 
a  chance-constrained  problem  will  be  given  by  using  a  simplified  version  of 
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the  problem  that  first  save  rise  to  the  chance-constrained  programming  formula 
tion  and  application.4  The  problem  is  one  of  multiperiod  scheduling  of  heating- 
oil  production  to  meet  an  uncertain  demand  that  depends  heavily  on  the  weather. 
In  the  original  problem  the  constraints  included  marketing  constraints,  which 
required  production  to  be  planned  in  such  a  way  that  customer  demands  were 
met  as  they  materialized,  and  storage  constraints,  which  restricted  the  amount 
of  oil  that  could  be  kept  in  inventory.  The  objective  of  the  problem  was  to 
maximize  expected  profits  over  the  entire  planning  horizon. 

As  Charnes,  Cooper,  and  Symonds  showed,'  a  deeper  consideration  of  the 
constraints  suggested  that  the  problem  was,  in  fact,  one  of  minimization  of 
expected  costs  rather  than  maximization  of  the  expected  profits.  Because  the 
constraints  of  the  problem  were  such  that  customer  demand  was  to  be  taken 
as  given,  i.e.,  stochastically  determined,  the  real  objective  was  to  supply  what¬ 
ever  demands  arose  at  a  minimum  total  expected  cost. 

A  simplified  version  of  this  problem  is: 


minimize 


subject  to 


<SR, 


W 


p  <  I 


f 

1  Ri 
i  i  1 


t 

S  SI  :  Offo,  .  .n 

I  «  I  1 


P(R  J  0)  >  1 


I  1 . ", 


(4) 


where  the  symbol  E  refers  to  the  expectation  operator,  so  the  objective  is  ex¬ 
pected  value  minimization  over  the  n  periods  of  the  planning  horizon  The  c, 
and  k,  represent,  respectively,  the  refinery  costs  and  the  inventory  carrying 
charges  for  period  j ;  both  c,  and  k, ,  )  =  1 ,  .  .  .  ,n,  may  be  random  variables. 

I,  is  the  initial  inventory  at  the  start  of  period}.  The  random  variable  S( 
represents  the  anticipated  sales  for  period}.  R,  represents  the  production 
rate  to  be  scheduled  in  period  |  and  is  the  quantity  for  which  a  decision  rule  is 
to  be  found. 

Thus  problem  4  shows  that  minimization  of  the  total  expected  production 
costs  over  periods  j  -  1,  .  .  .  ,n  is  desired,  subject  to  meeting  customer  de¬ 
mand  at  least  lOOor,  percent  of  the  time  in  each  period. 

The  probability  a,  is  known  as  the  “risk  level*  associated  with  the  tth 
constraint.  The  quantity 

£>'• 

is  called  the  “quality  level.*  Although  in  problem  4  these  quality  levels  include 
all  sales,  it  is  important  to  note  that  this  need  not  always  be  the  case.  For 
example,  we  may  adjust  the  original  random  variables  so  that  b*  refers  to  a 
certain  fraction  of  the  original  sales  variables.  This  would  not  change  the 
problem  in  any  mathematically  essential  way  but  it  would  alter  our  interpreta¬ 
tion  of  the  problem.  Thus  the  constraints  could  be  changed  tc  mean  that  in 
period  > ,  1000/  percent  of  all  sales  demands,  plus  or  minus  a  safety  margin, 
must  be  met  at  least  100a*  percent  of  the  time.  This  flexibility  in  interpreta- 


5 


tion  of  mathematically  similar  models  is  one  of  the  significant  features  of 
chance-constrained  programming.  In  solving  problem  4  it  is  convenient  to 
find  decision  rules  such  that  the  actual  refinery  production  rates  will  be  known 
exactly  at  the  start  of  the  period  to  which  they  apply.  In  other  words,  it  is 
desired  that  the  stochastic  decision  rules  Rj,  j  =  1,  .  .  .  ,n,  be  such  that  Rj  is 
a  known  number  at  the  start  of  the  jth  period.  This  means  that  the  admissible 
class  of  decision  rules  for  R(  can  involve  random  variables  only  insofar  as 
their  values  will  be  known,  i.e.,  will  have  been  observed,  at  the  time  of  appli¬ 
cation  of  R,.  Hence  R,  can  be  an  explicit  function  of  only  the  random  variables 
of  periods  1,  ...,)-  1,  and  it  cannot  be  a  function  of  the  random  variables  of 
the  j  th  or  future  periods. 

In  particular,  if  consideration  is  limited  to  linear  rules,  the  rules  will  be 
of  the  form  R~  TS  +  6,  where  T  is  a  lower  triangular  matrix.  Specifically,  we 
can  write 

^  "  ,?i yp»Si  *  f_1 . "  (5) 

where  each  y/(  ,  6/  is  a  scalar  and  yio  =  0  =  Yu. 

In  their  work,  Charnes,  Cooper,  and  Symonds4  were  able,  by  using  Eq  5, 
to  convert  problem  4  into  a  deterministic  equivalent,  which  was  then  solved  to 
give  the  optimal  values  and  bi  for  Ytt  and  6/,  respectively.  Using  these 
values,  the  optimal  decision  rules  R,,j  =1,  .  .  .  ,«,  for  problem  4  are  given  by 

«;  - 

r;  * 

R*3  - 

and,  in  general, 


It  is  important  to  observe  that  different  yr,  weights  apply  to  the  same 
observed  value  of  S, .  Thus  Sl  need  not  (and  in  general  will  not)  receive  the 
same  weight  in  Rj  as  it  did  in  R* .  But  once  y,t ,  6*,  |  >fc  ,  j  =  1,  .  .  .  ,n,  have 
been  found,  Eq  5  supplies  a  set  of  relations  (i.e.,  decision  rules)  that  can  be 
used  to  generate  the  required  numbers  (i.e.,  decisions)  at  the  time  these  num¬ 
bers  are  needed.  Thus,  as  the  observations  on  the  preceding  S,  values  are 
obtained,  the  resulting  Rj  values  will  be  known  with  the  certainty  the  problem 
requires. 

The  certainty-equivalent  relations  discussed  above  may  supply  all  that  is 
required  for  the  conduct  of  operations.  They  may  not,  however,  meet  all  the 
needs  of  management,  since  management  may  want  to  evaluate  different  alter¬ 
natives  before  committing  itself  to  a  given  set  of  policies  or  actions.  Since 
Eq  5  cannot  be  completed  until  the  S,  values  have  been  observed,  this  evaluation 
of  different  alternatives  cannot  be  achieved.  However,  when  a  deterministic 
equivalent  that  assumes  the  form  of  a  linear  or  convex  programming  problem 
is  available,  the  duality  relations  of  such  problems  can  be  used  to  supply  dual 
evaluators.  Then,  prior  to  obtaining  the  data  needed  to  achieve  the  certainty- 
equivalent  relations,  it  is  possible  to  study  the  overall  effects  of  variations  in 


yu- s.  * 

YsVl  *  yj2S2  +  63* 


fc-l  y'k  ‘  +  S> 
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risk  levels,  quality  levels,  etc.,  and  in  other  forms  of  constraint  alt*  mtion  and 
data  testing. 

This  completes  the  discussion  of  the  stochastic*heating-oil  prot  lem.  It 
has  been  examined  at  some  length  in  order  to  illustrate  one  possible  «rea  of 
application  of  the  results  developed  in  Sections  6—10  of  this  paper  an  I  in  Ref  5. 
However,  it  is  emphasized  that  this  is  not  the  only  area  to  which  char.ce-con- 
strained  programming  has  been  successfully  applied.  Charnes,  Coop  r,  Devoe, 
and  Learner®  consider  a  problem  of  selecting  studies  and  statistical  <  stimators 
in  new-product  marketing  studies.  Chance-constrained  programmin  ;  is  also 
very  useful  in  problems  of  financial  budgeting  and  planning  for  liquid  ty.  This 
is  illustrated  by  Charnes  and  Thore7  when  they  consider  the  problem  of  plan¬ 
ning  for  liquidity  in  a  savings  and  loan  association.  Their  results  are  extended 
further  in  a  RAC  paper  now  in  preparation.8  Also,  the  investigation  of  the  uses 
of  chance-constrained  programming  in  critical  path  analysis  was  begun  recently 
by  Charnes,  Cooper,  and  Thompson.9 

3.  SOME  CHOICES  OF  OBJECTIVES  IN  CHANCE- CONSTRAINED 

PROGRAMMING 

As  mentioned  in  the  “Introduction*  section,  there  are  many  possible 
choices  for  the  function  f  (c ,  X  )  in  problem  2.  In  fact,  it  must  be  emphasized 
that  the  very  concept  of  optimization  under  risk  and  uncertainty  immediately 
raises  important  questions  about  the  choice  of  rational  objectives.  Such  ques¬ 
tions  arise,  for  example,  concerning  the  reasonableness  of  an  ‘■'xpected-value 
optimization.  Why  not  choose  some  other  measure  of  value?  The  in,|>r  rtance 
of  this  question  becomes  clear  when  it  is  noted  that  the  decision  rules  that  are 
optimal  for  one  problem  and  a  given  objective  will  not,  in  general,  lie  optimal 
for  the  same  problem  under  a  different  objective. 

Charnes  and  Cooper10’11  investigate  three  different  classes  of  objectives; 
specifically,  they  examine  (a)  an  expected- value  optimization,  (b)  a  minimum- 
variance  (or  mean-square  error)  objective,  and  (c)  a  maximum-probability 
model.  These  models  are  called,  respectively,  the  “E  model*  “V  model,*  and 
*P  model*  of  chance-constrained  programming. 

It  is  of  interest  to  distinguish  between  the  first  two  models  and  the  third 
by  reference  to  what  H.  A.  Simon12  calls  the  “satisficing,"  as  opposed  to  the 
“optimizing,*  objective.*  Simon  originally  proposed  this  objective  as  an  lter- 
native  to  (a)  and  (b)  in  order  to  try  to  resolve  some  of  the  inadequacies  of  op¬ 
timizing  objectives  for  characterizing  certain  aspects  of  human  behavior.  In 
particular,  he  suggested  that  human  beings  do  not  always  seek  an  absolute 
extremum  before  taking  action  in  a  given  situation;  rather,  they  try  to  maximize 
the  probability  of  being  better  off  than  some  given  point  of  reference. 

In  the  P  model,  vectors  c',,l  V’,  are  specified  relative  to  some  set  of  values 
that  an  organism  or  human  being  will  regard  as  satisfactory  whenever  these 
levels  are  achieved.  Of  course  the  organism  confronted  by  an  environment 
subject  to  risk  cannot  be  certain  of  achieving  the  given  level  C’1  V’  when  effect¬ 
ing  its  choice  from  what  it  believes  to  be  the  available  alternatives.  Therefore 

tSee  also  March  and  Simon,15  and  the  extensions  and  applications  to  problems  of 
budgetary  management  given  in  Cooper  and  Savvas,14  Stedry,15  and  Stedry  and  Charnes.16 


it  tries  to  maximize  the  probability  of  obtaining  at  least  its  desired  level  coT  \° 
subject  to  its  feasible  alternatives  of  action. 

In  their  papers,  Charnes  and  Cooper10’11  express  the  E  model  as  follows: 


maximize 
subject  to 


E(ctX  ) 

P ( AX  <  b)  >  a. 


(6) 


where  A  is  a  matrix  of  constants  and  b,  c  are  vectors  of  random  variables. 
They  solve  problem  6  for  the  optimal  linear  rule  given  by  X  s  Db ,  and  so  con¬ 
vert  the  problem  to  one  of  finding  the  elements  of  the  matrix  D.  Under  suitable 
assumptions  concerning  the  symmetry,  and  the  existence  of  second  moments, 
of  certain  distributionst  they  show  that  the  deterministic  equivalent  of  problem 
6  is 


minimize 

-  "JtVb 

subject  to 

-  a!D>i b  -  •'i  -  -  i*b,  •  ,  =  1 . *■ 

-  k*  E[  b  -  oTDb  I*  ♦  v?  >  0,  i=l . m,  0) 

U|  I  I  I  r 

e,  2  0,  i=l . hi, 

where  ptr  =  iE(e)]F,  ,.E (b) ]T, afr  is  the  ith  row  of  A,  b  =  b  -  and  -  K'o,  s 
F,"1  (a, )  where  F(  (  .  )  is  the  cumulative  distribution  function  of  the  random 
variable  z,  =  (b,  -  a/  Db  )  /  E  Lb,  -  a,rDb  ]*.  This  is  a  deterministic  convex  pro¬ 
gramming  problem  in  the  variables  v,  ,  i  =  1 . .  and  the  elements  d,f  of  D. 

Using  the  same  kinds  of  techniques  and  limiting  themselves  to  finding 
linear  decision  rules,  Charnes  and  Cooper10’11  found  deterministic  equivalents 
for  the  V  model: 


minimize 

subject  to 

E  ( c T X  -  c® T  X®  )2 

P(  AX  _  b )  > 

X  >  Db, 

where  c‘,T, 

\°  are  given  vectors;  and  for  the  P  model 

maximize 

IMc’V  '  c®1  X®) 

subject  to 

P( AX  :  b)  2  a 

tFor  similar  developments  for  other  kinds  of  distributions  and  where  I)  is  spe¬ 
cialized  in  various  ways  see  Ben-Israel17  and  Ben-Israel  and  Charnes.1® 
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In  both  these  models  the  resulting  deterministic  equivalents  contain  the  con¬ 
straints  cf  problem  7  as  part  of  their  system  of  constraints,  although  they  also 
have  additional  constraints  and  a  different  objective  function. 

In  the  problem  to  be  considered,  starting  in  Section  6,  the  expectation- 
objective  formulation  will  be  used.  No  answer  is  attempted  to  the  question  of 
whether  it  is  the  “right*  objective  to  use,  but  it  seems  to  be  a  “reasonable* 
objective  for  problems  of  planning  over  an  n-period  horizon. 


4.  CHOICE  OF  DECISION  RULES 

The  developments  of  the  following  sections  are  concerned  with  establishing 
certain  properties  of  the  optimal  decision  rules  for  the  general  n-period  E- 
objective  model.  In  Section  6,  the  admissible  class  of  decision  rules  for  the 
problem  is  defined  as  the  most  general  class  of  rules  consistent  with  this 
interpretation  of  n-period  problems.  However,  in  order  to  illustrate  clearly 
how  closely  the  choice  of  an  admissible  class  is  related  to  this  interpretation 
of  the  constraints  of  the  problem,  problem  4  must  be  considered  again.  There 
it  was  seen  that  a  choice  of  the  admissible  class  to  be  R  -  TS  +  6,  where  T  is 
lower  triangular,  led  to  a  constraint  interpretation  of  the  following  kind: 

t-i  (~i 

P(lp  +  Rp£Sf!ltf*  Sj  5  Ip)  _  .n,  (g) 

that  is,  given  the  observations  S/,  j  =  1,  .  .  .  1,  and  decisions  R,,  j  =  1,  .  .  .  , 

/  -  1,  on  the  right  of  the  vertical  stroke  in  Eq  8  (the  stroke  is  used  to  suggest 
conditional  probability),  and,  before  S/  has  been  observed,  a  value  must  be 
found  for  R/  that,  when  added  to  the  'th  period’s  beginning  inventory  I  t ,  will 
be  sufficient  to  meet  the  unknown  demand  S/  with  at  least  the  specified  proba¬ 
bility  or. 

Clearly  then  other  choices  of  the  matrix  T  will  lead  to  (or  be  associated 
with)  different  ways  of  interpreting  the  constraints.  This  would  be  true  if,  for 
example,  R^  were  permitted  to  be  an  explicit  function  of  S|. 

This  problem  can  be  illuminated  further  by  considering  some  of  the  work 
done  by  Ben-Israel17  on  what  will  be  called  the  “zero-order  decision  rule.* 

Here  the  decision  maker  wants  to  know  all  his  program  values  in  advance, 
i.e.,  at  the  start  of  the  planning  period.  Such  a  situation  can  arise,  for  example, 
in  some  aspects  of  short-term  country  development  planning  as  well  as  in 
certain  kinds  of  budgetary  planning  practice. 

For  this  particular  case  Ben-lsrael’s  theorem17  is  employed  to  obtain 
the  result 


if  and  only  if 


r<  i 

i=  * 


fl|t  *  j 


ib': 


j  ,  aM  *)  "  ^1  *  '  Ql  '> 
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where  F,  (  .  )  is  the  cumulative  marginal  distribution  function  of  the  random 
variable  b,  and  y  is  identical  with  F"1  (1  -  q)  if  and  only  if  y  equals  maximum 
v :  F,  (y)  <  1  -  o, }.  Thus  the  chance-constrained  problem 


maximize 
subject  to 


Etc TX) 

IM  AX  i  b)  >  a  , 
V  >  0 


has  as  its  deterministic  equivalent 
maximize 

subject  to 

AV  2  F  ~  1  (  1  -  a  >  , 
X  2  0. 


(9) 


where  uj  =  E  ( c r). 

Ir  a  similar  manner  the  vector  G-1  (ft)  can  be  defined  as  such  that 


f>  U  1  A  '  c  1  )  2  ft 

if  and  only  if 

r  A  2  (  G  ~ 1  (  ^  > )  r 

Thus  if  G(  (  .  )  is  the  cumulative  marginal  distribution  iunction  of  c, ,  the  j  th 
component  of  cr,  then  y  =  (i~‘(ft)  if  and  only  if  y  min  i y : G, ( y )  ftj. 

Therefore  the  deterministic  equivalent  of 


minimize 
subject  to 

is:  minimize 
subject  to 


E<u,’b) 

JM  U  1  A  2  C  :  )  2  ft  . 
2  0 

o'  A  2  l  (,-'(£))  I1  . 

UJ  _1  0  , 


where  uh  ~  E (b ). 

The  result  obtained  is  the  same  as  that  of  Ben-Israel,  i.e.t  each  of  the 
following  two  problems  is  the  dual  of  the  other: 


maximize 

I  g-1  (  ft)  iT\ 
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subject  to 


P( AX  1  b)  >  a . 

no 


and  minimize 


subject  to 


<*1  F~l ( l-a) 

P(oi T A  2  cT)  >  P  , 
^  0  . 


This  example  clearly  illustrates  that  the  choice  of  decision  rules  here 
(or,  alternatively,  the  interpretation  of  the  constrain*-)  markedly  affects  the 
resulting  mathematical  treatment  of  the  model.  For  example,  in  the  case  of 
zero-order  decision  rules  it  is  possible  to  develop  a  duality  theory  along  the 
lines  of  the  foregoing  work,  making  the  solution  and  evaluation  of  such  problems 
much  simpler  than  when  much  larger  classes  of  decision  rules  are  considered. 

It  would  be  possible  to  extend  this  discussion  further  by  investigating 
other  special  and  interesting  cases.  Instead,  however,  n-period  problems  will 
be  considered  again  and  interpreted.  In  the  following  developments  the  inter¬ 
pretation  first  suggested  in  Section  2  will  be  used.  That  is,  the  admissible 
class  of  rules  for  the  |  th-period  decision  rule  can  include  explicitly  only  the 
random  variables  whose  values  will  have  been  observed  when  the  time  comes 
to  put  the  |  th  decision  into  effect.  Thus,  as  in  the  heating-oil  example,  the  (th- 
period  decision  rule  will  be  a  known  number  at  the  start  of  the  jth  period. 
However,  unlike  the  developments  outlined  in  Section  3,  no  other  restrictions 
will  be  placed  on  the  rules.  Hence  a  much  larger  class  of  rules  than  the  set 
of  linear  rules  already  discussed  will  be  considered,  and  yet  the  admissible 
class  will  include  the  class  of  linear  rules. 


5.  MULTIPERIOD  CHANCE -CONSTRAINED  PROBLEMS 

Before  proceeding  with  the  solution  of  the  general  n-period  .--objective 
problem,  a  brief  survey  will  be  presented  of  the  literature  on  n-period  problems 
as  it  pertains  to  this  discussion.  As  has  been  mentioned,  the  work  of  Charnes, 
Cooper,  and  Symonds*  is  fundamental  in  the  field.  In  addition  the  work  of 
Symonds19  on  stochastic  scheduling  by  the  horizon  method  and  Charnes  and 
Thore7  on  liquidity  planning  are  mentioned. 

If  the  special  case  of  chance-constrained  programming,  which  is  obtained 
by  making  all  the  constraints  hold  with  probability  one,  is  considered,  the  prob¬ 
lem  obtained  is  one  known  in  the  field  as  “linear  programming  under  uncertainty." 
Such  problems  were  first  discussed  by  Dantzig*°in  relation  to  what  he  called  the 
“n-stage  problem."  This  emphasis  on  staging  does  not,  however,  provide  a 
wholly  adequate  characterization  of  the  problem,  for— as  was  shown  by  Charnes, 
Cooper,  and  Thompson21— the  n-stage  problem  of  Dantzig  can  be  converted  into 
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a  one- stage  ordinary  minimization  problem.  Thus,  whereas  the  staging  em¬ 
phasis  may  appear  to  have  a  certain  appeal  in  distinguishing  between  this  and 
other  approaches  to  programming,  when  statistical  errors  are  present  in  the 
matrices  (as,  for  example,  in  Madansky***  *s ),  it  also  tends  to  conceal  the  sim¬ 
pler  characterizations  of  the  problem. 

Specifically,  Charnes,  Cooper,  and  Thompson*1  established  the  following 
important  result: 

Theorem  1 

Consider  the  problem 
maximize 

E(f<X, . X„;  b,...  .b„), 

subject  to 

i 

S  <.  =  b  i=l . m 

l  =  i  ' 

X,  >  0 

where  the  X(  are  piecewise  analytic  functions  of  the  b,  and  X, ,  and  f  is  a  linear 
function  of  \|,  .  .  .  ,  X„.  Then  the  optimal  decision  rule  for  X;  it'  a  piecewise 
linear  function  of  b,,  .  .  .  ,b,  and  X,. 

In  the  following  work  a  similar  result  will  he  obtained  as  an  immediate 
corollary  to  the  main  theorem. 

6.  THE  GENE RAI  ft -PERIOD  E  MODE! 

The  problem  to  be  discussed  is  the  following: 
maximize 

EU'X,  .  <\  \2  •  c,TX, . c'n\n) 

subject  to 


p<b,  i 

'll 

)  i  «,. 

p  ( b,  -• 

s. 

X, 

*  A  21 

^2 

)  i  Oj 

P(b,  - 

a3, 

X. 

^2 

*  AJ5Xj'  -  a  3 

P(bn  . 

X, 

*  *r%2 

A  2 

*  AUX3  * 

x, 

J  0 

|=l - ft 

where  P  stands  for  probability  and  E  denotes  the  expected- value  operator. 
(A  more  detailed  explanation  of  these  follows.) 
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In  problem  10  it  is  assumed  that — 

(a )  Al(  ,i  i  ~  1 ,  .  .  .  , n,  is  an  m(  r  n(  matrix  all  the  elements  of 

which  are  known  constants. 

(b)  ,n,  is  an  m,  >  1  vector  in  which  each  element  is  a 

prescribed  probability,  i.e.,  a J  (a,,,  .  .  .  ,oim  )  is  such  that  0  ■  olfc  *  1  for 

k  -  1 . m,,  i  =  1 . n. 

(c)  ( Cj . . c,n  ),  |  -  1 . .  is  a  1  '  n,  vector  of  random 

variables. 

(d)  bj  (  bu,  ....  bim|  I,  i  -  1,  ...  ,n,  is  a  1  *  m,  vector  of  random 
variables. 

(e)  the  joint  distribution  of  the  blk  and  c.t>,  k  1 . m,,f  1 . n,,j, 

I  ■  1,  .  .  .  ,n,  is  known,  i.e.,  the  multivariate  distribution  function  of  the  b,fc  and 
ctf  is  assumed  to  be  a  known  function. 

In  addition  to  these  five  assumptions  on  the  properties  of  the  matrices 
involved  in  the  problem,  some  conditions  on  the  way  in  w’Mch  the  problem  is 
to  be  solved  will  be  imposed  The  nature  of  these  additional  assumptions  is 
dictated  by  the  interpretation  of  the  problem  in  this  paper.  Problem  10  will 
be  treated  as  an  n-period,  or  n- stage,  problem  in  which  \, ,  the  vector  of  de¬ 
cision  rules  for  the  |th  stage,  is  selected  with  knowledge  of  all  decisions 
\i,  .  .  .  ,  and  observations  of  the  vectors  of  random  variables  b(,  c, ,  i  1, 

.  .  .  , |  —  1 ,  but  before  b( ,  i  ,  and  all  random  variables  of  periods  succeeding  the 
|th  are  observed.  Thus  X,  will  be  known  exactly  (i.e..  will  be  a  known  number) 
at  the  start  of  the  |th  period. 

In  other  words,  we  must  select  ,  our  vector  of  first-period  decisions, 
before  observing  the  value  of  the  first-period  random  vector  b,  and  the  first- 
period  cost  vector  c,  Then,  having  selected  X,  and  observed  b,  and  <  the 
second-period  decision  rules  \,  must  be  chosen  before  observing  the  values  of 
b..  and  c;.  This  process  continues  for  n  periods  with  X,  depending  only  on  X,  b, 
i,  .i  =  I, ....  I  - 1,  but  not  on  b( ,  c(  or  \  ,  b, ,  c, ,  i  i  +  1,  .  .  .  ,n.  This  interpreta¬ 
tion  means  that  the  information  at  any  stage  (aside  from  a  knowledge  of  the 
joint  distribution  of  b( ,  c(  ,  i  ,  |  r  1,  .  .  .  ,n)  is  limited  to  a  knowledge  of  the  de¬ 
cisions  and  observations  of  the  preceding  stages.  Thus  it  is  assumed  that— 

(f)  the  *  1  vector  X,,  |  ~1,  .  .  .  , is  a  function  of  X(  ,  b(,  c, ,  for  i  1 . 

I  —  1 ,  but  is  not  a  Junction  of  the  remaining  decisions  \t , i  _  |*T,  .  .  .  ,n,  or  of 

random  vectors  b, ,  c, ,  i  I,  .  .  .  ,n. 

X,  will,  of  course,  depend  implicitly  on  the  decisions  and  observations  of 
succeeding  stages  because  of  the  coupling  nature  of  the  constraints  of  problem 
10.  Assumption  f  means  that  X,  is  not  an  explicit  function  of  b(,  c(,  or  X,,  b, , 

Ci,  |*1,  .  .  .  ,n,  so  that  \  will  be  known  exactly  (i.e.,  will  be  a  known  number) 

when  the  jth  period  decision  is  made. 

As  a  consequence  of  this  assumption  it  can  be  seen  that  the  choice  of  X, 
affects  the  objective  function  of  problem  10  not  only  through  the  c[x,  term  but 
also  through  the  effect  of  X,  on  X, ,  j  2,  .  .  .  ,n.  X„  on  the  other  hand  enters  the 
functional  only  in  the  X„  term. 

It  will  now  be  shown  how  assumptions  e  and  f  enable  the  constraints  of 
problem  10  to  be  written  in  a  more  convenient  form.  The  P  operator  in  the 
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constraints  means  that  the  probability  is  computed  using  the  joint  distribution 
of  b,,  cj,i  ,j  =1,  .  .  .  ,n.  Thus,  by  using  a  Stieltjes  integral,  the  following  can 
be  written: 

P(b,  ,  A|1X,  *  ...  ♦  VV  *  HF(.) 

where  F(  .  )  is  the  joint  distribution  function  of  the  b, ,  c]ti ,  j =1 ,  .  .  .  ,n,  and  A 
is  the  set  where  b,  >  A,,  X,  *  .  .  .  *  A()X,. 

It  is  important  that  it  is  clearly  understood  what  is  meant  by  the  above 
identity.  This  is  most  easily  seen  by  writing  out  the  'th  of  the  equations 
(as  b,  is  m(  *  1 )  in  detail: 


P  (b, 


«f»  v 

fl,i  \ 


cf). 

3).  \ 


<p) 


V 


■  /.  / d  F ( b. , ,  .  b .  .  ,  b„ . . k._  .  c...  .  .c,.  .  .  .  ,c„ . cmm  ) 

'(f)  '  11  lt*t|  ttl  pi  m  ^  1  1  I  h  j  Hi  Fiflpi 

where  is  the  Pth  row  of  .4^  and  a^’  is  the  region  in  the  sample  space  of  the 
random  variables  involved  in  the  problem  over  whic!  b(p  -»  aj[>  X,  +  .  .  .  +  cjf’Xi 
+  .  .  .  +  Hence  the  compact  notation 


P<b,  i  A(|X,  ♦  .  .  ♦  A)(X()  ,  /  dF(  ) 

represents  m,  identities  of  the  type  shown. 

By  assumption  e,  F(  .  )  is  a  known  function.  Because  of  assumption  f,  the 
set  A  depends  only  on  blf  c,,i  1,  .  .  .  ,j  - 1 ,  and  b,  and  not  on  the  remaining  b, 
and  c | .  Consequently  the  integration  in  the  above  identities  can  be  performed 
by  integrating  first  with  respect  to  b)H,  .  .  .  ,b„,  and  cJt  ...  ,c„,  and  then  in¬ 
tegrating  with  respect  to  b, ,  .  .  .  ,b;,  and  c„  .  .  .  ,e,_,.  But  wh^n  integrated 
with  respect  to  b, ,  c,  ,i  =  j+l,  .  .  .  ,n,  and  c, ,  the  region  of  integration  is  over 
the  entire  range  of  possible  values  of  these  vectors  of  random  variables  (i.e., 
the  range  of  all  values  that  the  random  variables  can  assume  with  non-zero 
probability)  and  hence  the  resulting  value  of  the  integral  is  1.  Thus  "AdF(  .  )  = 
AdF((  . ), where  F,  (  .  )  is  the  joint  distribution  function  of  b, ,  .  .  .  ,  b,  and 
Cj ,  .  .  .  ,  c,_ ,  .  Moreover  F,  (  .  )  is  a  known  function  because  F(  .  )  is  known. 

Through  the  use  of  conditional  probabilities  ^dF  (  .  )  can  be  written  in 
terms  of  F(  (  .  ),  the  conditional  distribution  function  of  b,  given  b, ,  c „ »  - 1 , 

on  function  of  b, ,  c,,  i  -12  .  .  .  ,  )-l.  We 
F  (  .  ) )  d(ij  (  .  ).  Again  F  (  .  )  and  G(  (  .  ) 
a  from  a  knowledge  of  Ft  .  ). 

Now  a  ,|  =1,  ...  ,n  is  defined  as  an  m,  '  1  vector  of  probabilities  such 

that 


.  .  .  ,i-l,  and  G,  (  .  ),  the  joint  distribut 
have,  by  definition,  Ad Ff(  .  )  •  _^  \ 

are  known  because  they  can  be  comput 


dc  ( 


if  and  onh  if  /  d  F(  (  ) 


a 


I 


The  o,  will  be  conditional  probabilities  depending  on  the  given  (i.e.,  conditional) 
values  of  b, ,  c , ,  i  1,  .  .  .  ,)-l,  which  are  used  in  determining  F(  (  .  ).  In  other 
words,  Oj  is  such  that 
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if  and  only  if 


P(b;  -  Vvi  *  +WM 

where  the  P  operator  means  that  the  probability  is  computed  using  the  con¬ 
ditional  distribution  of  b,  given  b, ,  c„  i  1,  .  .  .  ,j-l,  i.e.,  using  F,  (  .  ).  Thus, 
for  example,  if  b,  is  independent  of  b„  c,,  i  =1,  .  .  . ,  | -1,  the  P  represents 
probability  using  the  marginal  distribution  of  b, . 

It  is  clear  from  the  above  development  that  if,  instead  of  having  P  and  or, 
as  in  the  constraints  of  problem  10,  a  similar  problem  is  given  whose  con¬ 
straints  involve  P  and  some  set  of  conditional  probabilities  5,,  j  =1,  .  .  .  ,n, 
the  corresponding  total  probabilities  P,  along  with  their  associated  or,,  can  be 
obtained  by  integrating  P  and  a,  with  respect  to  G,  (  .  ). 

It  is  important  to  note  in  this  development  that  o,  =  o,  because  F,  (  .  )  is 
the  marginal  distribution  function  of  b,  and  so  is  F|(  .  ).  In  other  words,  a, is 
a  vector  of  constants  and  its  components  do  not  depend  on  the  observed  values 
of  any  of  the  random  variables  involved  in  the  problem.  This  agrees  with  the 
interpretation  of  the  problem  in  this  paper  because  X,  must  be  selected  before 
any  of  the  random  variables  are  observed.  Consequently  the  first-period  con¬ 
straints  must  have  a  completely  deterministic  equivalent.  This  is  precisely 
the  type  of  interpretation  that  led  to  Ben-Israel's  theorem17  and  its  associated 
results  (see  Section  4).  In  brief,  X,  is  a  zero-order  decision  rule. 

The  question  of  how  to  determine  o,  for  given  or,  and  distribution  functions 
F,  (  .  )  and  G  (  .  )  is  extremely  difficult.  Here,  however,  determining  analytical 
properties  of  V  in  terms  of  is  of  chief  concern;  hence  the  problem  of  deter¬ 
mining  a,  will  be  left  for  a  subsequent  paper.  However,  it  must  be  emphasized 
that  a,  can  (and  in  general  will)  depend  on  the  given  values  of  b, ,  c, ,  i  ~1,  .  .  .  , 

I  - 1 .  This  is  of  particular  importance  because  it  greatly  enlarges  the  types  of 
constraints  that  are  permissible  in  such  a  model.  Elsewhere  in  the  literature 
it  is  always  assumed  that  the  probabilities  with  which  the  constraints  must  hold 
are  given  constants  no  matter  what  values  of  the  random  variables  are  observed. 
Thus  a  more  general  problem  is  being  treated  than  any  previously  considered 
in  the  literature. 

Relation  11  and  assumption  i  can  be  used  to  prove  the  following: 

Lemma  1.  The  constraint  P(  b,>  A,,X,  *  .  .  .  *  A„  X, )  >  o  in  problem  10 
can  be  replaced  by  the  equivalent  constraint 


where  F-‘  (l-o, )  is  an  m,  *  1  vector  of  the  l— o ,  percentile  (or  fractile)  points  of 
the  conditional  distribution  of  b,  given  b,  ,  c,  ,  1  *  i  ‘  |  •  The  ith  component  of 
F'1  (1-a,  ),  F_l  (l-o,t  )»  is  defined  by  F(~'  (l-a(|  )  5  maximum  ty:F,  (y)  r  1  - o, ,  i. 

The  use  of  a  maximum  in  the  definition  of  F,~ 1  (l-o,,  )  is  justified  because 
the  assumption  is  that  F,  (  .  )  is  left  continuous.*  The  reason  the  vector  in¬ 
equality  F,  (Y)  *  l-o,  does  not  necessarily  define  V  uniquely  is  due  to  the  fact 
that  some  of  the  random  variables  involved  in  F,  (  .  )  may  be  discrete  and  that 
some  of  the  components  of  l-o,  may  be  zero.  In  general  each  component,  y, , 

tFor  a  further  discussion  of  left  continuous  distribution  functions  see  Gnedenko, 
(Ref  24,  Ch  4). 
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of  Y  ran  take  on  any  value  in  an  interval.  If  this  interval  is  denoted  by  y,  (l-o,, ) 
and  y,  (1-0,1  ),  y,  (1— or ,, )  respectively  represent  the  smallest  and  largest  numbers 
in  the  interval  y.  (l-oM  ),  then  the  i  th  component  of  F~‘  (l-o  )  is  given  by 

y‘,U-V- 

It  is  assumed  throughout  this  discussion  that  F,  1  (1— or,, )  is  well  defined 
for  all  i,  j.  Thus,  for  example,  if  F,  (  .  )  is  the  distribution  function  of  a  normal 
random  variable  then  it  is  implicitly  assumed  that  no  component  of  a,  is  1,  so 
that  F  '*  (1-5  )  does  not  take  on  the  value  of  -  *>  for  any  i.  Similarly  it  can  be 
seen  tnat  o(l  cannot  be  zero  for  any  i  or  |,  since  in  this  case  l-o,,  =  1  and  so 
y,(l-u,,)  =  ♦  ®,  no  matter  what  distribution  F,  (  .  )  is  used.  This  is,  however, 
no  restriction  on  our  problem,  since  a,,  =  0  means  that  the  corresponding 
constraint  can  be  ignored  because  it  will  be  satisfied  for  any  choice  of  decision 
rules  X,,  .  .  .  ,  X,. 

Proof.  Assume  that  X, ,  1  <  t  *  |  are  decision  rules  that  satisfy 
P(  b,  >  AjjX,  +  .  .  .  ♦  A„  X, )  >  o,.  Then 

P(bj  i  A , ,  X ,  ♦  .  ♦  A  X,  )  a  a  (bv  relation  II), 


or 


1  -  P(b,  V', 


\,v  -  V 


or 


p(b,  .  a , , x ,  .  *  \,y  *  >-»- 


But  P  (b,  <  A(lX  ,  *  .  .  .  *  A ,, X f )  =  |j  d F,  (  .  )  where  I)  is  the  set  for  which 
b,  *  A, , X,  •*•...♦  A,,\,  =  Z.  Also,  by  assumptions  a  and  f,  A,lX1  +  .  .  .  +  A„X, 
is  not  a  function  of  b, ,  hence  Z  does  not  depend  on  the  still-to-be-observed  value 
of  b, .  Consequently  we  have 

/dF,<  )  .  f:d  F~<  I  -  f'o  -  f‘u,  X,  .  A„\  >. 

H 

Thus  the  |th  constraint  of  problem  10  requires  that 


But  any  distribution  function  is  a  nondecreasing  function  of  its  argument,  so 
that  F,  ( A, ,  X,  +  .  .  .  •*  A,,  X,  )  5  l-o,  implies 


Since  all  the  steps  of  the  proof  are  completely  reversible,  relation  12  implies 


Thus  the  lemma  is  proved. * 

Using  Lemma  1,  problem  10  is  written  in  the  following  equivalent  form: 


tNote  the  similarity  between  Lemma  1  and  Ben -Israel’s  theorem.17 
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maximize 


subject  to 


F  *  C  1  X  1  •*  f  J  X  2  4  f3^3 


V 


.  r.-'n-a-,. 


Aji  X,  ♦  Ajjj  Xj 

^31  X1  *  X  3  2  X  2  *  ^33  X3 


1  Fj-'U-Bj). 
•  Fj-'ll-a-.. 


(13) 


\l  X  1  4  ' n 2  X 2  4  Xn3  X3 


X,  -  "  I' 


The  E  operator  in  the  functional  of  problems  13  and  10  means  tnat  the 
expected  value  is  being  taken  with  respect  to  the  joint  distribution  of  b , ,  c, ,  i  , 

)  =1,  .  .  .  ,n .  By  using  the  linearity  of  the  objective  function  in  \(,  r  1 ,  .  .  .  ,n, 
the  following  may  be  written: 


E(  1  cTX  1 
1  =  1  1  ' 


!  /t/V F,  ) 


i  =  i  -« 


However,  this  expected-value  operation  can  be  simplified  by  making  use 
of  assumption  f.  Since  X(  is  a  function  of  \  ,  b(,  c,,  i  1,  .  .  .  j-1,  only, 


Eu,T.y  •  ?  f,TVF(  ■  >  *  7  </\ 


dFt<  l 


d  FJ(  ) 


-  7f/v v-’ 

—  as 

where  JF(  (  .  )  is  the  conditional  distribution  function  of  b,,  ...  ,b^,  c, ,cn, 
given  b, ,  .  .  .  ,b(_,  ,  c,,  .  .  .  ,  c,  ,  and  F,(  .  )  is  the  joint  distribution  function  of 

b  „  .  .  .  ,b(_, ,  c  . . cr  Thus.in  computing  E(c 1  \( ),  cT  \(  is  integrated  by  using 

only  F,  (  .  )  rather  than  F(  .  ).  11 


7.  MATRIX  THFORY 

In  the  following  discussion  of  the  optimal  decision  rules  use  is  made  of  a 
generalized  inverse  of  a  matrix.  The  properties  required  of  such  inverses 
are  listed  below.  Proofs  may  be  found  in  Charnes  and  Kirby*'  and  Rao.,fl 

Definition.  Let  A  be  any  m  *  n  matrix.  Then  any  n  >  m  matrix  A*  such 
that  AA8  A  A  is  called  a  generalized  inverse  of  A. 

It  can  be  shown  that  such  an  inverse  always  exists  and  is  unique  if  and 
only  if  m  n  . 

Lemma  2.  Let  A  be  an  m  »  n  matrix.  Then  there  exists  an  A"  such  that 
AA"  I  if  and  only  if  rank  (  A)  m,  i.e.,  A  has  a  right  inverse  if  and  only  if  it 
has  full  row  rank. 

Lemma  3.  The  matrix  equation  A\  b  is  consistent  if  and  only  if  AA8  b  b. 

Lemma  4.  Let  the  matrix  equation  A\  ~  b  be  consistent.  Then  the  general 
solution  is  given  by  \  -  A"b  +(|-  A* A  )Y  where  V  is  an  arbitrary  n  »  1  vector  and 
I  is  the  n  >  n  identity  matrix. 
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Using  these  lemmas,  a  necessary  and  sufficient  condition  can  be  given 
for  the  existence  of  a  vector  V  satisfying  the  set  of  inequalities  AX  ^  b,  \  -  0. 
For,  there  exists  \  0  such  that  AX  •  b. 

“  there  exists  X,  S,  '  0  such  that  AX  +  S  b  where  S  is  an  n  ■  1  vector 
“there  exists  X,  S,  ■*  0  such  that  AX  b-S,  AA"  (b-S)  b-Sby  using 
Lemma  3. 

“there  exists  an  arbitrary  vector  Y  and  a  vector  S  ■*  0  such  that 

\ a" <  b -s )  -  b-S 
A*  I  b-S)  2  (  A*  A-l)  N 

by  using  Lemma  4.  In  this  case  X  -  A*  (b-  Si  +  ( I-  A"  A )  Y .  Therefore  there 
exists  a  finite  n  »  1  vector  X  satisfying  the  set  of  inequalities  AX  *  b,  X  •»  0 
if  and  only  if  there  exists  a  finite  n  »  1  vector  Y  and  n  »  1  vector  S,  S  0  such 
that  AA*  (b-  S)  b-  S  and  A*  (b-  S)  %  (  A* A -I )  Y.  From  Lemma  2  it  can  be  seen 
that  if  A  has  full  row  rank,  the  condition  AA*  (b-S)  -  b-S  becomes  an  identity 
and  it  is  only  required  that  there  exist  vectors  Y  and  S  ^  0  such  that  A*(b-  S) 

>  (A* A-l)  Y. 


8.  REMARKS  ABOIHT  THE  CONSISTENCY  OF  THE  CONSTRAINTS 

A  significant  property  of  chance-constrained  programming  models  is  that 
the  constraints  generally  represent  the  “intentions"  of  management  rather  than 
hard  and  fast  “rules."  In  other  words,  the  constraints  rep-esent  “bounds"  in¬ 
side  which  management  would  like  to  operate  “most  of  the  time*  rather  than 
"all  the  time.*  The  decision  rules  resuh'ng  from  solving  a  chance-constrained 
problem  are  designed  to  give  guidelines  rather  than  definite  plans  of  action  for 
management  operation  and  decision,  subject  to  qualifications  surrounding  the 
controls  involved  in  implementing  the  rules.  Thus  the  optimal  decision  rules 
for  our  problem  may  impute  an  action  that,  because  of  exceptional  circumstances, 
cannot  actually  be  taken. 

Another  major  feature  of  the  chance-constrained  programming  concept 
is  that,  in  contrast  to  so-called  “linear  programming  under  uncertaintv,"  the 
constraints  (e.g.,  1"  a1(i(  b,  ,  as  in  relation  3)  need  not  hold  with  probability 
1.  This  generalization  is  important  on  several  realistic  grounds.  First,  it 
may  not  be  possible,  or  even  desirable,  to  specify  actions  in  every  conceivable 
circumstance,  i.e.,  for  every  possible  value  of  the  random  variables.  For  ex¬ 
ample,  in  the  heating-oil  problem  (see  Section  2),  certain  combinations  of 
events  (which  really  did  occur  once  or  twice)  were  impossible  for  the  company, 
acting  alone,  to  handle  either  physically  or  economically.  Second,  constraints 
that  have  the  nature  of  “policies*  are  not  really  intended  to  apply  in  every  in¬ 
stance  but  only  “almost  all  the  time.*  Hence  the  constraints  of  the  problem  in 
Section  2  say  that  the  supply  of  heating  oil  must  exceed  the  customer  demand, 
not  all  the  time  but  100 a,  percent  of  the  time,  where  a,  is  some  number  close 
to  but  not  equal  to  1.  Third,  oy  their  nature,  policies  (as  opposed  to  definite 
rules)  need  not  spell  out  in  advance  the  actions  that  will  be  taken  in  exceptional 
circumstances. 
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Chance-constrained  programming  allows  for  such  exceptional  circum¬ 
stances  in  a  very  easy,  natural  manner,  particularly  when  the  admissible 
classes  are  defined  in  sime  straightforward  analytical  manner,  e.g.,  as  linear 
decision  rules.  However,  questions  arise  when  one  wishes  to  consider  “all 
possible"  or  “all  possibly  desirable*  stochastic  actions  in  time  sequence.  It 
may  then  be  seen,  on  reflection,  that  an  essential  feature  of  the  analytically 
specified  admissible  class  is  that  the  analytic  description  carries  one  through 
sets  of  random  events  that  imply  constraint  violations  or  the  impossibility  of 
actual,  as  opposed  to  imputed,  further  actions  in  the  time  sequence.  For  ex¬ 
ample,  in  the  heating-oil  problem  the  linear  decision  rules  defined  a  value  of 
R(  no  matter  how  large  the  values  observed  for  S, ,  i  =  1,  .  .  .  ,|—  1 .  Clearly 
then,  if  extremely  large  demands  were  observed  in  periods  1,  .  .  .  ,j-l,  it  would 
be  physically  impossible  for  the  company  to  produce  enough  oil  to  meet  the 
amount  given  by  R(.  This  would  represent  an  exceptional  circumstance  and  the 
company  would  not  (and  could  not)  follow  the  decision  for  period  j  imputed  by 
the  linear  decision  rule.  Thus  it  is  important  to  consider  the  question  of  what 
analytic  specifications  should  be  made  to  carry  one  through  such  situations  in 
solving  for  optimal  rules. 

In  addition  this  example  shows  that  if  constraints  that  place  an  upper 
bound  on  R. ,  |  -  1,  .  .  .  ,n  are  added  to  problem  4  it  is  possible  that  the  con¬ 
straints  will  be  inconsistent  for  some  values  of  S, ,  |  ~  1,  .  .  .  ,n.  Therefore 
discussion  of  the  difficulties  involved  when  the  constraints  of  the  problem  are 
inconsistent  must  precede  the  establishment  of  certain  properties  of  the  decision 
rules  that  are  optimal  for  this  problem. 

The  question  of  inconsistency  is  of  significance  because,  as  has  been 
illustrated,  one  of  the  major  (and  frequently  overlooked  and  misunderstood) 
properties  of  chance-constrained  models  is  that  points  for  which  the  constraints 
are  inconsistent  may  well  exist  in  the  sample  space  of  the  random  variables 
involved  in  the  problem.  In  other  words  there  may  be  points  (b,,  .  .  .  ,  bw 
clt  .  .  .  ,c„)  in  the  sample  space  of  the  random  variables  b, ,  c , ,i  ,  |  =  1,  .  .  .  ,n, 
for  which  there  do  not  exist  decision  rules  X,,  j  -  1,  .  .  .  ,n  that  satisfy  the 
constraints  of  problem  13.  Thus  it  is  possible  that  for  some  values  of  the  ran¬ 
dom  variables  b , ,  c , ,  i  1,  .  .  .  j  —  1 ,  and  first  |-1  decisions,  X,,  i  =  1,  .  .  .  ,j—  1 , 
there  will  not  exist  an  n,'  1  vector  X,  '*0  satisfying  A„  \,  *  F'^l-a,)-  An_,  X,-, 

-.  .  .-A,,V,.  In  such  cases  the  question  arises  as  to  the  meaning  of  the  expec¬ 
tation  operator  in  the  functional.  How  is  E(  cj  X^)  calculated  over  those  sample 
points  for  which  no  feasible  X.  exists? 

One  method  of  resolving  this  difficulty  is  to  replace  1"  ,  E  ( cJX.)  in  the 
functional  bv  l"  1  E(  (  cJX (),  where  E,  means  that  the  expected  value  is  taken  over 
only  those  sample  points  for  which  there  exist  decision  rules  X, ,  .  .  .  ,  X, , 
satisfying  the  first  |  constraints  of  problem  13.  Thus 

*STV  ■ 

—  OK 

but 

E,“,TV  • 

where  A,  denotes  the  region  where  the  first  j  constraints  of  problem  13  are 
consistent. 
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If  A,  denotes  the  complement  of  A,,  it  can  be  seen  that  E(cJX^)  and  E^cjXj) 
are  related  by 

E,<c,TV  -  E{f}TX,)  -  _/  cj\td  F,(  •  )  . 


\ 


By  applying  the  mean  value  theorem  we  obtain  the  equation 


EicJ  V()  -  Ef(cJ  X,)  -  kP(  A,) 


where  k  is  a  constant  and 

P(  \,i  -  /  d F,(  •  >  ■ 

\ 

Thus  we  see  that  whenever  P(A,)  is  small,  the  use  of  E,  provides  a  close  ap¬ 
proximation  to  E. 

The  concept  of  an  analytic  decision- rule  class  under  chance  constraints 
originated  by  Charnes  and  Cooper 2‘ l0,JT’ 2®  provides  a  direct  and  simple  means 
of  specifying  the  decision  rules  across  the  inconsistent  points  of  the  problem. 

In  this  case  the  analytic  description  of  the  class  permits  X,  to  be  defined 
throughout  all  points  of  the  sample  space  of  the  random  variables  (see  the  pre¬ 
ceding  discussion  and  the  example  of  Section  2).  Therefore  when  a  class  of 
decision  rules  is  specified,  X,  becomes  defined  throughout  A,  and  the  use  of 
the  E  operator  again  becomes  meaningful. 

One  way  of  defining  X,  in  A,  is  to  specify  X,  -  0  in  this  region.  The  effect 
of  this  definition  is  to  make  E,  (cJ.X,)  =  E(cTX,).  Moreover  use  can  be  made  of 
the  material  developed  in  Section  7  to  specJy,  a  priori,  the  region  A,  over 
which  X,  =  0  will  be  put.  It  is  known  from  Section  7  that  there  exists  a  decision 
rule  X,  0  satisfying  A()X,  <  fJ'Kl-or, )  -  A,,.,  X(_,  -  .  .  .  -  A, ,  X,  if  and  only  if 
there  exist  vectors  Y,  and  S,  0  satisfying 


and 


A  A  *<  F_l 
M  II 

( l -0|) 

-  V. 

-  \,V 

1  -  V 

■  r 

‘(1-0,1 

-  A„-. 

>v.  - 

- 

.  -  V 

v.\ 

-1  *  • 

- 

-  S  I  i 

1 

•  V  v'»\ 

this 

region 

\ 

-  V,-. 

V.  - 

'  - 

-  V 

‘  '|-a,;aI(.  v 

i 


If  D,  denotes  the  region  where  a  feasible  X,  exists,  it  can  be  seen  that  A,,  the 
set  of  points  for  which  the  first  |  constraints  are  consistent,  is  given  by  \  ,  0, 
Hence  A,,  the  complement  of  A|t  equals  ,  1), ,  where  I),  is  the  complement  of 

'V 

The  specification  of  X,  as  constant  (  0)  over  its  region  of  inconsistency 
makes  it  possible  to  extend  the  results  obtained  below.  In  Theorem  2  it  will 
be  proved  that  in  the  region  where  the  constraints  are  consistent  the  optimal 
decision  rule  for  X,  gives  \(  as  a  piecewise  linear  function  of  F^'U-a, ),  and 
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XM,  .  .  .  ,  Xi .  Therefore  specifying  X,  =  0  in  A,  means  that  the  optimal  de¬ 
cision  rules  become  piecewise  linear  everywhere. 

Another  method  of  resolving  the  difficulty  of  inconsistency  that  could  yield 
values  of  X,  /  0  in  A,  is  to  modify  the  formulation  of  the  constraints  of  the  prob 
lem  so  that  the  right-hand  side  of  the  constraints  in  problem  13  is  operated  on 
by  a  projection  into  the  range  of  the  A„  operator.  This  projection  would  be  the 
identity  operator  when  consistency  holds  and  would  yield  some  near  point  in 
the  range  of  A„  when  the  |th  constraint  is  inconsistent  (if  the  Penrose- Moore 
inverse29  were  used  in  the  projection  operator,  the  near  point  would  be  the 
nearest  feasible  point  in  the  Euclidean  sense  ).t  With  this  change  in  the  con¬ 
straints,  the  E  operator  could  then  be  applied  without  modification  and  the  re¬ 
sulting  functional  value  of  E (cj  X()  would  differ  from  E(  ( X( )  by,  at  most,  some¬ 
thing  on  the  order  of  P(A  ). 

Regardless  of  the  procedure  used  to  resolve  the  difficulty  of  inconsistent 
sample  points,  the  problem  of  determining  the  optimal  decision  rule  for  X,  is 
reduced  to  finding  the  optimal  rule  when  the  constraints  are  consistent.  At 
those  sample  points  where  sufficient  consistency  exists  for  the  constraints  and 
the  functional  to  be  meaningful,  the  argument  of  the  following  section  is  valid. 


9.  OPTIMAL  CLASS  OF  DECISION  RULES 

Let  (b, , .  .  .  ,  b„,  c, ,  .  .  .  ,c„)  be  any  point  in  the  sample  space  of  the  vectors 
of  random  variables  b, , c, ,i ,  |  =  1,  .  .  .  ,n,for  which  there  is  consistency,  i.e., 
let  this  sample  point  be  in  ^,D,.  Suppose  that  for  each  such  consistent  point 
the  decision  rults  are  found  to  be  X*,  .  .  .  ,  XJ,  maximizing  L ”  (  «. T X,  subject  to 
the  constraints  of  problem  13.  Then  X*.  |  -  1,  .  .  .,n,are  the  optimal  decision 
rules  for  the  problem.  This  follows  from  the  fact  that  for  any  other  feasible 
decision  rules,  X, ,  j  =  1,  .  .  .  ,n,  for  problem  13,  I"  ,  cj  X(  *  -»  I"  c  T  X.  results 
from  the  definition  of  X*,  |1,  .  .  .  ,n.  This  implies  that  Ed^y  X(*) 

E(I"  ,c(r  X(),  thus  establishing  the  optimally  of  X*,  i  =  1,.  .  .  ,n,for  points 
where  consistency  is  present. 

Theorem  2 

The  optimal  decision  rule  for  X,  in  problem  13  is  of  the  form 

\  (  •  a  pin  fHise  linear  fun  ( I  ion  of  I  hr  (  1  •o|  •  and  \  f  \j 

Proof.  By  assumption  f,  X,  ,  |  1,  .  .  .  ,n,  influences  the  objective  function 

through  the  terms  cj  X,,  c* ,  XM1  ,  .  .  .  Hence  the  effect  of  the  choice  of 

\„  appears  in  ‘he  functional  only  in  the  c([  V  term.  Consequently  the  first  step 

is  to  find  the  optimal  decision  rule  for  Vi  as  a  function  of  X^  . X,  and 

then  proceed  recursively  by  finding  the  optimal  X„_ ,  as  a  function  of  X,,_2,.  .  .,  X, 
until  the  problem  is  ultimately  reduced  to  finding  the  optimal  X,. 


♦  See  Ben-lsrael  and  Charnes30  and  Other  references  on  Penrose-Moore  inverses 
given  in  Section  J. 
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To  avoid  the  detailed  computation  for  each  stage  the  theorem  will  be 
proved  by  induction  on  i  where  j  =  n  *  1  -  i.  After  proving  the  theorem  for 
l  =  1,  i.e.,  after  proving  that  the  optimal  \  is  given  as  a  piecewise  linear  func¬ 
tion  of  F~‘(l-orn),  . \v  it  is  then  assumed  for  induction  that  the  theorem 

is  true  for  t  =  k ,  i.e.,  for  Vii*i  .  •  •  •  .  V  It  is  then  proved  true  for  l  =  It  +  1, 
i.e.,  for 

Let  (b„  .  .  .  ,b„,  . . cj  be  any  consistent  point  in  the  space  of  possible 

values  for  the  vectors  of  random  variables,  b, ,  c,  ,i  ,  j  =  1,  .  .  .  ,n.  For  this 
sample  point  the  problem  of  maximizing  I "  cTX.  subject  to  the  constraints  of 
problem  13  will  be  solved  and  thus  the  optimal  decision  rules  Xj*,  j  =  1.  .  .  .  ,n, 
will  be  obtained. 

Let  X,,  .  .  .  ,X„_,  be  specific  values  for  the  first  n-1  decision  rules.  Then 
our  problem  becomes  one  of  determining  a  decision  rule  for  \  that  solves 

maximize 
subject  to 


2  o 

But  because  \(,  |  =  1 . n-1  is  specified  and  the  focus  is  on  a  single  point 

in  the  sample  space,  the  right-hand  side  of  the  constraints  in  problem  14  is  a 
known  vector.  Moreover  by  assumption  a  all  A,,  are  constant  matrices;  hence 
problem  14  is  a  linear  programming  problem  and  so  has  a  dual.  Its  dual  is 


dx. 


*nn\  - 


(14) 


minimize 
subject  to 


-  Anl  Xl] 


r.  ,  ,t 

Jit  ^  n  ' 


1  0  . 


(15) 


If  problem  14  has  a  finite  optimal  solution  X£,  then  the  convex  set  of 
feasib’e  solutions  to  problem  15  has  a  finite  number  of  extreme  points,  denoted 
by  u.*,  i=l,...  ,\n;  and,  from  the  dual  theorem  of  linear  programming,  it  is 
concluded  that 


V* 


I 


min 


1. 


1  -a 


) 

n 


A  tin  —  1  A  n  —  I 


(16) 


Since  there  is  only  a  finite  number  \„  of  extreme  points,  problem  16  implies 
that  \„*  is  a  pie(  wise  linear  function  of  F“‘  (l-**,,)  -  A^,,  .  .-  A^,  X,  with, 

at  most,  \„  pieces.  Hence  Yf,  is  piecewise  linear  in  Fn_,(l-on)  and  \w_1,  .  .  . , X, . 
Thus  the  theorem  is  true  for  <  1. 

Now  assume  for  induction  that  the  theorem  is  true  for  t  k.  Then  there 
exist  functions  Hj  WF“ 1  (1— or( ),  \f_ ,  ,  .  .  .  ,  \,.|  =n-k  +  l,  .  .  .  ,n,  which  are  piecewise 
linear  in  F_l (1— or( ),  \(_,  ....  , \, ,  and  are  such  that  \*  H(  ,|  «-k*l,  .  .  .  ,n. 

Thus  problem  13  can  be  written  as 
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maximize 

E,C!  X1  *  ♦  ‘ 

subject  to 

'11 ' i  '  r  r1  *  1  -« i » . 

-  F-‘n-a:l. 


cl  HJ 


(  17) 


‘.-o': 


i  r; 


,  *  1  -  a  . 


V  -0,  |  ■  1.  ....  n  •k 

Again  let  (b  ,  .  .  .  ,b„,  c, ,  .  .  .  ,cn )  be  a  particular  sample  point  and  let 
\  ,  .  .  .  f  -V, ,  be  specified  values  for  the  first  n-k-1  decision  rules.  Then 
\*_k  is  obtained  by  solving  the  following: 


maximize 
subject  to 


c]  .X,,  ♦  I  C  H, 

n~*  n~*  |  “  n  —  k*  I  ’  1 


'  n  -*  -  Fn  -*  1  ~an  ~k  *  ~  -I  '  *  ~k -I 


‘  (18) 


\ 


n  —to 


2  o  . 


The  functional  in  problem  18  is  piecewise  linear  in  \,_fc  because  of  the 
piecewise  linearity  of  the  H|fj  =n-k*l,  .  .  .  Moreover  the  optimal  Vi_k  can 
be  obtained  by  first  solving  problem  18  over  each  piece  where  the  functional 
is  linear  in  \*_k  and  then  selecting  the  piece  that  is  optimal.  That  is,  the  solu¬ 
tion  space  of  problem  18  can  be  divided  into  sets  K , ,  .  .  .  ,ks,  such  that  in  any 
set  kr  the  functional  in  problem  18  is  linear  in  \„_k.  Problem  18  is  then  solved 
for  \n*h  r  the  optimal  \,.k  in  kr,r=  1,.  .  .  ,\,  and  finally  \£_k  is  selected  such 
that 


n 

i 


■VV 


»W 


r-1 . N 


Since  for  any  set  kr  the  functional  in  problem  18  is  linear  in  X„_k,  the 
problem  in  kr  is 


maximize 

subject  to 
A. 


r  ■'  l  1  -a..  I  -  A 


n  — to  ,  n  — to  - 1  n  - 1 


2  0 


(19) 


where  i' J_fc  is  a  1  *  nn  k  matrix  whose  components  depend  on  »he  sample  [joint 
( b,,  .  .  .  ,b„,  c ,,  .  .  .  ,c„)  and  the  given  values  of  \lt  .  .  .  , 

Because  problem  19  is  a  linear  programming  problem  like  problem  14, 
by  writing  its  dual  and  proceeding  exactly  as  was  done  in  going  from  problem 
14  to  problem  16,  it  can  be  seen  that  X*^  is  a  piecewise  linear  function  of 
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F-j^d-tt^,  X,,-*-,,  •  •  •  , X,.  Since  this  is  true  for  each  region  Kr  over  which 
the  functional  in  problem  18  is  linear,  it  is  concluded  that  is  indeed  piece- 
wise  linear  in  F^fl-o^),  Vk-i.  •  •  .  X,. 

Thus,  by  induction,  the  theorem  is  proved. 

Corollary  1.  The  optimal  decision  rule  for  \,  ,|  _  1,  .  .  .  ,n,  in  problem  13 
is  of  the  form  \,  -  a  piecewise  linear  function  oi  the  ^"'(l-a,),  i  1,...,|. 

10.  OPTIMAL  DECISION  RULES  FOR  SOME  SPECIAL  CASES 


The  results  of  the  preceding  section  apply  to  any  model  satisfying  assump¬ 
tions  a  through  f.  These  assumptions  are,  however,  very  general  and  so  it  is 
instructive  to  look  in  closer  detail  at  certain  special  cases  that  yield  particu¬ 
larly  interesting  results.  More  specifically  the  chief  concern  will  be  the  case 
in  which  problem  10  is  an  n -stage  linear-programming-under-uncertainty 
problem  (i.e.,  or,p  -  1,  l  1  ,  .  .  .  ,m, ,  i  =  1,  .  .  .  ,n  ). 

However,  another  important  special  case  of  problem  10  is  considered  in 
Ref  5.  This  is  the  case  in  which  each  A)(  matrix,  i  >)  ,i  ,  |  -  1,  .  .  .  ,n,  is  a 
1  ■  1  matrix.  In  other  words,  each  period  generates  exactly  one  new  constraint 
(instead  of  m,  new  constraints  as  in  problem  10).  Charnts  and  Kirby'1  have 
shown  that  there  is  much  more  to  say  about  the  solution  of  such  problems  than 
has  been  said  in  Theorem  2.  In  particular  an  explicit  method  of  obtaining  the 
optimal  decision  rules  under  certain  circumstances  is  shown. 

To  begin  with,  however,  it  is  observed  that  Corollary  1  also  implies  the 
following. 

Corollary  2.  The  optimal  decision  rule  for  \,  ,j  -  1,  .  .  .  ,n,  in  problem  13 
is  piecewise  linear  in  the  b, ,  c, ,  i  - 1,  .  .  .  , j  —  1 ,  if  and  only  if  Fp~'(l-ap ),  f  =  1, 

...  ,1,  is  piecewise  linear  in  blf  c, ,  i  1,  1. 

As  an  example  of  a  situation  in  which  Corollary  2  is  applicable,  consider 
me  case  where  b , ,  .  .  .  ,b„,  c„  .  .  .  ,cn.  are  jointly  normally  dis'ributed^nd 
Of  is  a  constant  for  all  j,  t ,  and  so  does  not  depend  on  the  given  values  of 
b , ,  c ,,  i  -  1, .  .  .  ,i  1.  Then  F(  (  .  )  is  the  distribution  function  of  a  normal  random 
variable  with  mean  m  and  variance  6*.  Moreover  m  is  a  linear  function  of  the 
random  variables  that  are  given  in  the  conditional  distribution.  Thus  m  is  a 
linear  function  of  b,,  c, ,  i  1,...,|-1.  Also  6*  is  a  constant.  Therefore 
F(  (yp )  ♦  (  yp-m/  6  )  can  be  written  where  4>(  .  )  is  the  distribution  function  of  a 

normal  random  variable  with  mean  0  and  variance  1.  Thus 

r ,  (  \  p  i  •  l-tjjp  •  \p  »  r  j  1  (  1  —  a  !  p '  -  f>  •t’  ~ 1  (  1  -  a  !  p  •  »  m  . 

But  Ojf  is,  by  hypothesis,  a  constant.  Hence  ♦_1(l-o|p )  is  a  constant,  and  the 
linearity  of  m  implies  the  linearit”  of  F'Kl-o  p).  Therefore  each  component 
of  F-'d-o, ),  i  1,  ...  ,n,  is  a  linear  function  of  b, ,  c, ,  i  1,  .  .  .  ,|  -1. 

To  give  examples  where  the  fractile  points  are  linear  in  the  given  sample 
points  and  the  are  not  constant  is  fairly  difficult.  In  fact  it  is  clear  that  \* 
will  not,  in  general,  be  piecewise  linear  in  b, ,  i , ,  i  1 . 1  -  1 . 

Another  special  case  of  this  corollary  falls  into  the  category  of  linear 
programming  under  uncertainty.*  Suppose  that  o,  1,  |  1,  .  .  .  ,n.  Then  by 

t  See  Charnes,  Cooper  and  Thompson’1  and  G.  B.  Dantzig  et  al.^’ 31-54 
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our  definition  of  a,  there  must  also  be  o,  1,  )  =  1,  .  .  .  ,n.  Assume  also  that 
the  conditional  distribution  of  b , ,  |  1,  .  .  .  ,n,  given  b,,  c ,  ,|  1,  .  .  .  ,|  -1,  is 
such  that  its  Oth  fractile  point  does  not  depend  on  the  given  values  of  b, ,  c,, 

i  1 . (-1.  Then  Fj^'d-Oj )  F*'(0),  |  =1,  ...  ,n,  becomes  a  const;  nt  vector 

the  components  of  which  are  known  before  any  observations  have  been  made. 

The  solutions  space  of  problem  13  is  therefore  a  fixed  convex  set  wlose  bound¬ 
ing  hyperplanes  do  not  depend  on  the  random  variables  involved  in  the  problem 
Thus  we  are  led  to  the  following: 

Theorem  3 

Assume  a  1 ,  |  =  1 ,  .  .  .  ,n. 

Assume  Fj'KO),  j=l,  .  .  .  ,n,  is  a  vector  of  constants. 

Then  the  optimal  decision  rule  for  in  problem  13  is  such  that  \(  is 
piecewise  constant. 

Proof.  By  Corollary  1  the  optimal  is  piecewise  linear  in  Ff 1  ( 1  -o’, ). 
i  1.  .  .  .  ,|.  Therefore  X,  is  piecewise  linear  in  the  F7‘(0),  i  1,  .  .  .  ,|-1. 

But  F7 1  (0 )  is  a  constant  by  assumption.  Hence,  \,  is  piecewise  constant. 

This  result  was  also  obtained  in  a  different  manner  by  Charnes,  Cooper 
and  Thompson21  for  the  n-stage  linear-programming- under- uncertaintv  model 
(see  Theorem  1). 

The  assumption  used  above  that  F^MO)  be  a  constant  vector  is  not  as 
restrictive  as  it  might  appear  at  first  glance.  Indeed  this  assumption  is  true 
for  a  very  large  class  of  distribution  functions  F(  (  .  ).  In  fact  in  Theorem  3 
only  such  distributions  are  eliminated  as  F,  (  .  ),  the  distribution  function  of  a 
random  variable  that  is  uniform  over  ( a,  b  )  where  a  depends  on  b(,  c|f  1  -  1, .  .  ., 

I  -1.  However,  if  a  is  a  constant,  b  can  depend  on  b(,  c,,i  1,  .  .  .  ,|-1,  and 
F,  (  .  )  will  still  satisfy  the  requirement  that  Ff‘(0)  be  a  constant. 

This  discussion  also  illustrates  one  of  the  major  weaknesses  of  linear 
programming  under  uncertainty,  i.e.,  that  for  many  distributions  the  constraints 
of  problem  13  are  inconsistent  everywhere.  Suppose  again  that  b,,  c,,i  =  1, 

.  .  .  ,n,  are  jointly  normally  distributed.  Then  F;  (  .  )  is  the  distribution  function 
of  a  normal  random  variable,  hence  F^'fO)  -  *\  Consequently  there  do  not 

exist  decision  rules  \(  ,  p  1 . n,  that  satisfy  the  constraints  of  problem  13, 

no  matter  what  sample  points  are  observed. 

Further  results  are  obtained  on  the  linear-programming-under-uncertaintv 
problem  by  restricting  the  above  model  still  further.  To  do  this  it  is  assumed 

that  c , ,  |  -  1 . n  is  a  vector  of  constants  rather  than  random  variables  and 

then  problem  13  is  shown  to  reduce  to  an  ordinary  linear  programming  problem. 
When  c,,  i  1,  .  .  .,n,  are  constants,  the  E  operator  means  that  the  expected 

value  of  L"  ,  c*\,is  taken,  using  the  joint  distribution  of  b . . b„.  Therefore 

if  problem  13  is  solved  for  a  given  sample  point  ( b . . b„),  the  optimal  de¬ 
cision  rules  \*,|  1,  .  .  .  ,n  will  be  obtained  for  this  particular  .‘ample  point. 

But  a,  1,|  1,  .  .  .  ,n,  and  by  assumption  F(~‘(0)  is  a  vector  of  constants;  hence 

the  constraint  set  of  problem  13  does  not  depend  on  the  sample  point  ( b, . Ni). 

Moreover  since  the  c,  are  constant,  problem  13  is  now  completely  deterministic. 
Thus  \^,  1  1,  ...  ,n  will  tie  optimal  for  all  sample  points  and  these  optimal 
decision  rules  can  be  found  by  solving  the  ordinary  linear  programming 
problem 
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maximize 

,v:\ 

subject  to 

A i ,  \ ,  <  Ff'toi . 

a2,v,  *  a22\2  <  F-'«o.  . 


\  2  0.  1-1.  ...  .n 

Another  example  of  a  situation  in  which  the  solutions  space  of  problem  13 
does  not  depend  on  the  observed  values  of  b, ,  c,,  i  1,  .  .  .  ,n  can  be  obtained 
by  supposing  that  bj .  j  1,  ...  ,n  is  stochastically  independent  of  b, ,  c,,  i  1, 

.  .  .  , j  —  1  and  that  o,  ,  |  1,  .  .  .  ,n  is  a  constant  vector.  Then  F7‘(l-a(  )  is  a 

constant  vector,  since  it  represents  the  1-a,  th  fractile  point  of  the  marginal 
distribution  of  b,  .  But  by  assumption  e  this  marginal  distribution  is  known, 
so  F-'fl-o  )  can  be  computed  before  taking  any  observations.  Thus  the  solutions 
space  of  problem  13  does  not  depend  on  the  observed  random  variables. 

Even  when  b,  is  dependent  on  the  b,,  c,,  i  1,  .  .  .  ,| -1,  Ff 1  (l-o,)  is  a 
vector  of  constants  because  it  represents  the  m,  •  1  vector  of  1-a,  percentile 
points  of  the  marginal  distribution  of  b„and  a,  or,  is  constant. 

It  is  interesting  to  note  that  if  £, ,  |  1,  ...  ,n  is  a  constant  vector,  if 

b,.  |  1,  .  .  .  ,n  is  independent  of  the  joint  distribution  of  b , ,  i  1,  .  .  .  ,|  l.and  if 

a,,  I  1,  ...  ,n  is  constant,  then, by  an  argument  analogous  to  that  used  above, 
it  can  be  concluded  that  problem  13  becomes  an  ordinary  linear  programming 
problem. 


11.  EXAMPLES 

To  illustrate  the  results  obtained  two  examples  are  presented.  In  the 
first  example,  problem  20,  the  constraints  are  consistent  for  all  sample  points 
and  all  values  of  aj  in  ,0,  1  In  the  second  case,  problem  23,  it  can  be  seen 
how  a  well-posed  problem  can  have  sample  points  for  which  the  constraints 
are  inconsistent. 

In  both  these  examples  the  various  a,  will  not  be  explicitly  written  as 
functions  of  b( ,  r( ,  i  =1 , .  .  .  ,j- 1.  Thus  the  problem  will  be  solved  treating  as  an 
arbitrary  function  of  its  conditional  random  variables,  subject  only  to  the  re¬ 
striction  that  0  r  Qtf  ’  1  for  all  |  and  f.  In  addition  it  will  be  assumed  that  our 
problems  are  given  in  the  form  of  problem  13  rather  than  problem  10,  so  that 
the  constraints  involve  P  and  o,  and  not  P  and  o(. 

Example  1 

As  our  first  example  the  following  two-stage  problem  in  which  \,  x , , 

and  Xj  x2  are  1  ■  1  vectors  (i.e.,  i,,i2  are  single  variables)  will  be  solved: 


26 


maximize 

fu..i  ♦  fj«2' 

subject  to 

f’(  h,  1  »,)  .1  a  ,  . 

I’ I  b2  ^  l,  ♦  t2\  2  a  i  •  (20) 


where  c,  is  uniformly  distributed  over  .15,  25,  and  c,  is  uniformly  distributed 
over  .  c  |  -  6,  c,  *  4  ].  Thus  the  mean  of  c  2,  given  c ,,  is  c,  - 1 ;  but  c like  c(, 
has  a  range  of  ten  units,  b,  is  uniformly  distributed  over  ’  100,  200'  and  b2  is 
uniformly  distributed  over  75,  2b,  -  75  .  Thus  the  mean  of  b2,  giver  b,.  equals 
b, ,  but  the  range  of  b ,  is  not  the  same  as  the  range  of  b, . 

In  addition  it  is  assumed  that  c,  and  c,  are  independent  of  b,  and  b:.  The 
solution  of  problem  20  is  begun  by  computing  and  Fd(l-o2). 

Let  v  be  a  random  variable  that  is  uniformly  distributed  over  a,  b  . . 

Let  y  be  the  l-o th  fractile  point  of  y .  Then  "v(l/b  -a)  1  -  o.  Hence 

y  (b-  a)  (l-o )  +  a.  Thus  the  equivalent  form  of  problem  20  is: 


maximize 
subject  to 


EU,i, 


r , « ,  > 


i  ,  *_  100  (1-0,1  .  100 

I  ,  ♦  r  j  1  I  2b ,  -  1 V)  >  <  1  -  a  2  >  ♦  "*>  . 


(21) 


Or,  equivalently, 
maximize 

subject  to 


ru,i| 


\ 


I  ,  i  200  -  100  o  J  . 


I,  *  «J  *_  21  l-«j'b, 


1  V)  a  2  -  ?■>  . 


*1  *2  -  0 


Let  (b, ,  b2,  c , ,  c  2)  be  any  sample  point  and  let  x,  be  given.  Then  to  deter¬ 
mine  i*  the  following  must  be  solved: 

maximize 

c j  ij 

subject  to 

Xj  i  2<  1  -a  j '  b,  ♦  IV)  a 2  -  TV  -  i,  . 

*2  -  0 
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But  i  0  always,  so  that  t  *  2(1-0.)  b,  *  150o:  —  75  —  * , .  To  find  the  optimal 

i,  the  following  must  now  he  solved: 

maximize 

r 1  ‘  i »,  *  «*  ) 

subject  to 

t  -  1001  I  -a  I  *  100  . 

(22) 


In  problem  22  I  means  that  the  expected  value  is  taken  with  respect  to 
the  joint  distribution  of  b,,  b..  c , ,  c ...  But  i,  is  independent  of  these  random 
variables  by  assumption  ff  and  i  ,  and  c..  are  independent  of  b,  and  b..  Hence 

Ml,!,)  E  ( «.,)*,  L, ,  i,  f 1 ,,  ,  ,(v  )  I,  E 1 1  ( c , )  X ,  20  X  t ,  where  the 

subscript  of  the  1  denotes  the  distribution  used  in  computing  the  expectation 
and  i  .  tj  means  that  the  conditional  distribution  of  t ,,  given  c(,  is  used. 

Also 

(  ,  i  *  -  f  I  i  .  (  !  -  <7  ,  I  k  ,  *  I  *><>  7  ,  -  ">  -  *  ,  I  I  . 

*  2  f  t  ,  N  j  ^  -  2  f  1  i '  ,  a  ,  h  j  1  ♦  i  ,a  1  —  7  "i  f  i  C  , 1  -  l  j  r (  i  ,  l , 


where  the  expectation  is  computed  using  the  joint  distribution  of  b , ,  b:,  t , ,  c. . 
However,  in  order  to  find  i,*  it  is  necessary  to  know  explicitly  the  expression 
for  1  U.t*)  as  a  function  of  Hence,  as  M  f.-)  Lt,  ,..U:)  f , , .  f , ,  , , ( i .) 
f  ,  ,(«.,-!)  19,  to  obtain  t,  the  following  must  be  solved: 


Maximize 
subject  to 


-'<1 1 ,  •  [-!»«,  *  k  1 


« ,  •-  l»o  'i-o.i  •  inn  . 


where  k  21L.b,)  2 1  (. .  o.h, )  *  1 501  (»■  .or.)  -  75 1  ( r .). 

Therefore 


i’  »  Iihi  i -,7,i  •  inn 


and 


l  -  J  I  -  a  ,  h  •  I  »<  *  r/  ,  •  l  J  »  -  1 1 M  •  a  j 

Since  P 1  ( 1  o  )  was  linear  in  b,t  ,s  seen  to  be  also  linear  in  b,.  which  agrees 

with  C'orollarv  2.  Also  if  o  , ,  o.  1,  i  *  is  piecewise  constant  (in  fact  it  is  a 
constant),  as  was  predicted  by  the  discussion  of  the  linear-programming- under- 
uncertaintv  case. 


1 1n  fact,  ironi  the  interpretation  of  the  general  problem,  it  is  clear  that  the 
vector  of  first -period  decisions,  will  always  have  I  (  \,i  This  is  another  direct 

consequence  >1  a-  a  /ero-order  decision  rule  (see  Section  *>). 
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Fxample  2 

For  our  second  example  the  following  problem  is  considered: 
maximize 

E ' ^ I  X1  *  C2  *2 ' 

subject  to 

,M  -  aM  *|  *  -  • 

P '  b2  ~  a2 1  *  ,  *  a22  *2  *  -  °2  1  (23) 

0. 


where  C|  and  c.  ire  defined  as  in  the  previous  example;  lb  and  b..  are  jointly 
normally  distributed  with  means  m,  and  m;  and  variances  6;  and  t>-:,  respectively, 
and  correlation  coefficient  o  ^  0;  a,,  and  a::  are  positive  constants  and  a  ,  is  a 
negative  constant;  and  c,  and  c:  are  independent  of  b,  and  b:. 

From  our  definition  of  b,  and  b  it  can  be  seen  that  their  joint  frequency 
function  is  given  by 

l 

f  h  h  i  -  - _  -  -  r  M1 

J-rA,  l-,73 


/h,-mA‘  /t),  -  mA/Sj 

V  .s-  7  "T7  -)\~)  T  .s-  7 


and  f(!>  b,  ),  the  conditional  frequency  function  of  b:  given  b, ,  satisfies 


where  |i(b,)  is  the  marginal  frequency  function  of  b,.  Therefore 


r  -| 

1 

1 

tS , 

-  p  -  (h  .m  » 

—  ~ 

b,  \  -  n  \  1-,)* 

1 

<*> 

i 

so  that  b  ,  given  b , ,  is  normally  distributed  with  mean  m.  *  p  (ft  /  ft,)  ( b,  - m , ) 
and  variance  ft  (1-p  ). 

Thus  f  *(l-a..)  ft.  1-p- ♦  '(l-o, )  *  m  -p  (6r/ 6,)»!.,  *  p  (ft./ ft,  )b,  ,  when 
4>(  .  )  is  the  distribution  function  of  a  normal  random  variable  with  mean  0  and 
variance  1.  Let  fci  ,  k  be  defined  by 


k  t  ,S  N  1  (t1-1  1  1  -  a  ,  > 


♦  m  . 


'ft 


i 


k 


i 


0 


Then  a  problem  equivalent  to  problem  23  is 
maximize 

iii  *  > .  i  > 

it  . 

subject  to 


(24) 
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Let  (bu  b.,  c,,  c- )  be  any  sample  point  in  the  space  of  random  variables 
b,  .  bJ(  f|,  c,.  Let  i,  be  a  given  first-period  decision  rule.  Then  to  find  i? 
the  following  must  be  solved: 

maximize 

i  ,  X 

subject  to 

a22  t2  k,  b,  *  k2  -  a.,  «  . 

« ,  -  i>, 


or,  equivalently, 
maximize 

subject  to 


I 


I 


" 


as  a::  0  by  hypothesis.  Since  c.  0,  also  by  assumption,  the  following  is 

obtained: 


no  franiblr  •••■liilion  if  It  (  ♦  hj  -  0  2  {  0 


Thus  region  0.  discussed  above  is  seen  to  be  defined  by  k,  b,  *  k,  -  a.,,1,  *  0. 
Moreover  since  au  0  it  can  be  seen  from  problem  24  that  D[  is  the  jegion 
for  which  6,  4>*l(l-fli|  )  ♦  m,  *  0,  so  that  the  union  of  Di  D.>  gives  the  set 

Using  the  rule  x?  0  in  A,,  the  optimal  i,  is  found  by  solving  the  following 


maximize 

f ilfiV  *  c2 «* » 

subject  to 

an  i,  ■_  <t>_l (  i  -  v  ♦  "i, 


If  />,  <J>  l(  1  -q  j )  *  m,  >  0,  then  a  feasible  i,  exists  and 


(25) 


But  l\.  is  the  region  where  k,  b,  *  k.  -  a. ,x,  *  0,  that  is,  b,  >  (a,,/k,  )  i,  -  (k  k,  ) 
as  k,  0  and  k,  is  a  constant  because  a,  is  a  constant  (see  Section  6).  If  it  is 
now  assumed  that  k  is  also  a  constant,  i.e.,  that  o.  is  a  constant,  the  following 
is  obtained: 


30 


Ej(  CjX*)  -  1  Of  (  —  b,  +  —  -  —  x,) - 

a..  kAfl22  °22  °22  '  f>,\j2rr 


-KV)! 

db, 


d21  ,  _^2_'  22 
fe,  1  ~  fe  , 


22  22  '  V 


■f  80 

19  / 


(fe ,  5,  k .  k,  o,i  \  - 

+  — - —  x.)e  2  dz  . 

^22  ^22  ' 


_  *  2 


fl21X!~k2-  mlfel  V  a2 


fe.5, 


Finally,  using  the  fact  that  x,  is  to  be  independent  of  b,  and  c, ,  problem 
25  can  be  written: 


maximize 


20 1 . 


19  a, 


“22 


L  V  k,  S,  /J  V  o,2  a22  /  [_  V  fe.S,  /J 


19 fe ,  5j  +■» 


/  r  <5(  z )  d  r 

a22  a21  x, -fe2-m,  fe1 

feT^ 


subjv^t  to 


•u 


1  <t>~1(l-a1)  +  mj, 


x,  >  0 


(26) 


where  o(  .  )  is  the  frequency  function  of  a  normal  random  variable  with  mean 
0  and  variance  1.  Solving  this  problem  (which  can  be  done  easily  by  applying 
the  Lagrange-multiplier  technique  to  problem  26),  the  following  is  obtained: 

*  5,  4>-1  (  1  -a  i)  +  m, 


Thus  the  optimal  decision  rules  are 

IS,  <t>“* ( 1  -a , )  +  m,  ,  -  . 

_i _ ! - 1  ,  if  S,  <t>  (  l-n,)  +  m,  £  0 

0,1 

0.  i.e.,  no  feasible  Xj  otherwise. 


X  ,  -  I  Al  b,  +  -!!i-  -  —  Xt  ,  if  b,  -  X,  -  and  X,  is  feasible 


0,  i.e.,  no  feasible  x2  otherwise. 
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